Chemical structure recognition: a rule-based approach

نویسندگان

  • Noureddin M. Sadawi
  • Alan P. Sexton
  • Volker Sorge
چکیده

In chemical literature much information is given in the form of diagrams depicting molecules. In order to access this information diagrams have to be recognised and translated into a processable format. We present an approach that models the principal recognition steps for molecule diagrams in a strictly rule based system, providing rules to identify the main components — atoms and bonds — as well as to resolve possible ambiguities. The result of the process is a translation into a graph representation that can be used for further processing. We show the effectiveness of our approach by describing its embedding into a full recognition system and present an experimental evaluation that demonstrates how our current implementation outperforms the leading open source system currently available.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Statistical Approach for Recognizing and Classifying Patterns of Control Charts (RESEARCH NOTE)

Control chart pattern (CCP) recognition techniques are widely used to identify the potential process problems in modern industries. Recently, artificial neural network (ANN) –based techniques are very popular to recognize CCPs. However, finding the suitable architecture of an ANN-based CCP recognizer and its training process are time consuming and tedious. In addition, because of the black box ...

متن کامل

A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features

Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performanc...

متن کامل

A rule-based approach for recognition of chemical structure diagrams

In chemical literature much information is given in the form of diagrams depicting chemical structures. In order to access this information electronically, diagrams have to be recognised and translated into a processable format. Although a number of approaches have been proposed for the recognition of molecule diagrams in the literature, they traditionally employ procedural methods with limited...

متن کامل

Control Chart Recognition Patterns using Fuzzy Rule-Based System

Control Chart Patterns (CCPs) recognition is one the most important concepts in control chart application. Relating the patterns exhibited on the control chart to assignable causes is an ambiguous and vague task especially when multiple patterns co-exist. In this study, a fuzzy rule-based system is developed for X ̅ control charts to prioritize the control chart causes based on the accumulated e...

متن کامل

A Fugacity Approach for Prediction of Phase Equilibria of Methane Clathrate Hydrate in Structure H

In this communication, a thermodynamic model is presented to predict the dissociation conditions of structure H (sH) clathrate hydrates with methane as help gas. This approach is an extension of the Klauda and Sandler fugacity model (2000) for prediction of phase boundaries of sI and sII clathrate hydrates. The phase behavior of the water and hydrocarbon system is modeled using the Peng-Robinso...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012